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ABSTRACT 

We investigate the power of void statistics to constrain galaxy bias and the amplitude of dark matter fluctua- 
tions. We adopt a ACDM cosmological scenario (inflationary cold dark matter with a cosmological constant) 
and use the halo occupation distribution (HOD) framework to describe the relation between galaxies and dark 
matter After choosing HOD parameters that reproduce the mean space density fig and projected correlation 
function Wp(rp) measuremed for galaxy samples with Mr < -19 and Mr < -21 from the Sloan Digital Sky 
Survey (SDSS), we predict the void probability function (VPF) and underdensity probability function (UPF) 
of these samples by populating the halos of a large, high-resolution N-body simulation (400/1"' Mpc, 1280^ 
particles). If we make the conventional simplifying assumption that the HOD is independent of large scale 
environment at fixed halo mass, then models constrained to match and Wp(rp) predict nearly identical void 
statistics, independent of the scatter between halo mass and central galaxy luminosity or statistical uncertainties 
in HOD parameters. Models with power spectrum normalizations trg = 0.7 and erg = 0.9 also predict very sim- 
ilar void statistics, with stronger galaxy bias compensating for weaker dark matter fluctuations in the low-erg 
model. However, the VPF and UPF are sensitive to environmental variations of the HOD in a regime where 
these variations have little impact on Wp(rp). For example, doubling the minimum host halo mass in regions 
with large scale (5/;"' Mpc) density contrast 6 < -0.65 has a readily detectable impact on void probabilities 
of Mr < -19 galaxies, and a similar change for 5 < -0.2 alters the void probabilities of M,- < -21 galaxies at 
a detectable level. The VPF and UPF provide complementary information about the onset and magnitude of 
density-dependence in the HOD. By detecting or ruling out HOD changes in low density regions, void statistics 
can reduce systematic uncertainties in the cosmological constraints derived from HOD modeling, and, more 
importantly, reveal connections between halo formation history and galaxy properties. 

Subject headings: cosmology:theory — large scale structure of the universe 



1. INTRODUCTION 

Voids are an omnipresent feature of the galaxy distribution 
jKirshner et al. 1987; de Lappai-ent, Geller, & Huchra lI99U 
l^gelev et al.i il994f iHovle & Vogelev. .2004; .Conrov et al.l 
l2005t see lRoodlll988l for a historical review). Galaxy red- 
shift surveys reveal that underdense regions 30 /i"'Mpc in 
diameter are common, and that they can be as large as 50 
/i"'Mpc. A fundamental question regarding the nature of 
voids is whether they are as empty of matter as they are of 
light, or whether the breadth and depth of observed voids is a 
consequence of a lower efficiency of galaxy formation in un- 
derdense regions. For the matter distribution, the frequency of 
large voids is sensitive to depar tures from Gaussianity in the 
primordial density fluctuations JWeinberg & ColellI992h . In 
Gaussian models, the frequency of voids depends on the rms 
amplitude of matter clustering and on the bias between galax- 
ies and mass (Einasto et al. 1991; Weinberg & Cole 1992; 
ILittle & Weinberg 1994). Little & Weinberg (1994) show that 
void statistics are sensitive not only to the amplitude of galaxy 
bias, but also to the detailed form of the relation between 
the underlying matter distribution and galaxy formation effi- 
ciency. In this paper, we revisit the connection between voids 
and galaxy bias in the framework of the halo occupation dis- 
tribution (HOD), which has emerged in the last few years as a 
powerful method for characterizing the relationship between 
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galaxies and mass fKauffmann et al.' '1997* 'Benson et al 
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EOOO; Seliak 2000; Peacock & Smith 2000; Ma & Fry 200 
Iscoccimarro et al.l2001tlBerlind & WeinbergM2002l hereafter 
BW). 

Many different methods have been propo sed for identify- 
ing voids and quantifying their importance ( Rvden & Turned 
1984; Kauffmann & Fairall 1991; Kauffmann & Melott 199 J 
lEl-Ad & Piraniil997i Hovle & Vogelev 2002). In this paper 
we will use two of the simplest statistical measures of cos- 
mic voids: the void probability function (VPF, denoted Pq) 
and the underdensity probability function (UPF, denoted Pu). 
The VPF is defined as the probability that a randomly placed 
sphere of radius r contains no galaxies of a given class. The 
UPF is similar but quantifies the probability that the galaxy 
density within a randomly placed sphere is less than a de- 
fined threshold, usually 0.2 times the mean density for galax- 
ies of that type. The VPF d epends on the full hierarchy of 
«-point correlation functions (Whitel ll979h . so in principle it 
contains complementary information to the two-point corre- 
lation function ^gir). The first significant measurement of 
the VP F and UPF was the analy sis of the CfA redshift sur- 
vey bv lVoeelev et alJ (119921 [T994 . The current state of the 
art is the analysis of the Two-Degree Field Galaxy Redshift 
Survey (2dFGRS, Colless et al. 2001) by Hovle & Vogele^ 
( l2004il) . which covered 1500 deg^ to a median redshift of 
~ 0.1. Forthcoming results for the Sloan Digital Sky Survey 
(SDSS, York et al. 2000) should yield even more precise mea- 
surements of the void distribution owing to its larger survey 
volume, which will ultimately extend over nearly a quarter of 
the sky. 

The galaxies that define the voids are, in general, a bi- 
ased tracer of the underlying mass distribution. A fully non- 
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linear model of galaxy bias is required to compare theory 
to observations for any clustering measure. The HOD for- 
malism describes bias at the level of dark matter "halos," 
where a halo is a structure of overdensity p/p^ 200, in ap- 
proximate dynamical equilibrium (identified in this paper by 
the Friends-of-Friends algorithm, see With this defini- 
tion, the mass function, spatial clustering, and velocity statis- 
tics of dark matter halos are essentially independent of gas 
physics, and they can be determined accurately for a given 
cosmology using high-resolution N-body simulations (e.g., 
IWarren e tal. 2005; Springel et al. 2005) or analytic approx- 
imations calibrated on these simulations. In the HOD frame- 
work, one specifies the bias of any particular class of galax- 
ies by specifying P{N\M), the probability that a halo of mass 
M houses such galaxies, together with any spatial or ve- 
locity biases between galaxies and dark matter within indi- 
vidual halos. To the extent that the HOD is independent of 
a halo's larger scale environment (a point that we return to 
below), this description of bias is complete, for all cluster- 
ing statistics on all scales. The HOD framework has been 
used to model measurements of the projected galaxy corre- 
lation fimctioiijji^fr^ the SDSS (Zehavi et al. 2004a, 
iTinker et aljl2005al) . The conditional luminosity func- 
tion, which combines the HOD with luminosity information, 
has been used to analyze the correlation function and luminos- 
ity function of the Two-Degree Field Galaxy Redshift Survey 
(e.g. [Yang et al. 2002, 2004; van den Bosch et al. 2003). Ap- 
proaches to modeling void statistics with the HOD have been 
discussed by Benson (2001) and BW. Here we follow the ap- 
proach of BW and calculate the VPF and UPF by populating 
dark matter halos identified in collisionless N-body simula- 
tions. 

It is well known that the halo mass functio n cor- 
relates with large-scale environment ( Bond et al. 1199 It 
iLemson & K auffmann" 19991 ISheth & Tormeni i2002l) . Mas- 
sive halos form from high-sigma fluctuations in the primor- 
dial density field, which reside preferentially in regions that 
are overdense on larger scales tiKaiser...l984.) . In low-density 
regions, high-sigma fluctuations are rare, so these environ- 
ments are dominated by low-mass halos. We therefore ex- 
pect void statistics to be sensitive to halo occupation at low 
halo masses, where the occupation number is zero or one. 
B W adopted a power-law mean occupation function for galax- 
ies above a luminosity threshold, i.e., {N)m = - 
(M/Mi)", with a cutoff at M^in below which halos could not 
house galaxies above the threshold. Their results showed a 
strong correlation between M^i„ and the sizes of voids. Here 
we adopt a more physically motivated occupation function 
that divides halo occupation into central and satelhte galax- 
ies (iGuzik & Sehakll20()2t iKravtsov etaDl2004t IZheng et al.l 
l2005h . In this parameterization, the behavior of {N)m at low 
masses is essentially determined by the relation between halo 
mass and the properties of the central galaxy. For each galaxy 
population that we model, we only consider HOD models that 
are constrained to match the mean space density Hg and the 
projected c orrelation f unction Wp{rp) of SDSS galaxies mea- 
sured by Zehavi et al. (2005). In this respect, we are investi- 
gating what additional information can be obtained from void 
statistics once these constraints have been imposed. In this 
context, we want to answer several questions: 

1. Do void statistics provide additional information on the 
minimum halo mass for a galaxy sample, within the range 
allowed by the constraints of matching the observed fig and 



2. Are void statistics sensitive to the scatter in the relation 
between halo mass and central galaxy luminosity, which alters 
the form of the cutoff in (A^)m? 

3. Can void statistics discriminate among values of trs, the 
rms linear theory amplitude of mass fluctuations in 8 /i"' Mpc 
spheres, if galaxy HODs are chosen to reproduce the observed 
correlation function? (Here h = i/o/100 kms~' Mpc~'.) 

We will show that the answer to all three questions is, for 
practical purposes, no. Once one has matched fig and Wp{rp), 
variations in the VPF and UPF that result from varying Mmin 
within the allowed range, from changing the form of the low- 
mass cutoff, or from altering erg, are all at the level of error 
expected from the full SDSS sample. We will demonstrate 
that the aspect of the HOD that has the greatest influence on 
void statistics is the fraction of galaxies that are satellites, be- 
cause once this is determined the cutoff mass M,^in is essen- 
tially fixed by fig. The satellite fraction is well constrained by 
matching Wp(rp). Therefore, within the standard cosmologi- 
cal and HOD framework, the predicted void statistics are re- 
markably robust. The imposition of the observed correlation 
function as a constraint is the crucial difference between our 
work and the analysis of B W, and it accounts for our differing 
conclusions about the information content of void statistics. 

In the standard implementation of HOD models, includ- 
ing those that lead to the conclusions mentioned above, the 
statistics of galaxy occupation are assumed to depend on 
a halo's mass, independent of its larger scale environment. 
Semi-analytical models of galaxy formation that incorpo- 
rate this assumption reproduce the observed correlations of 
galaxy luminosity, color, and morphology with environment 
(Benson et al. 2003; Norberg et al. 2001); these arise because 
the halo mass function itself varies with large scale over- 
density. Direct observation al suppor t for this assumption 
comes from the work of Blant on et alJ ll^04L who show that 
the variation of blue galaxy fraction with overdensity mea- 
sured on a 6 /i"' Mpc scale is fully explained by the variation 
on the 1 Mpc scale charac teristic of i ndividual large ha- 
los. On the theoretical side. Bond et a Us ([19911) excursion 
set derivation of the extended Press-Schechter (1974) for- 
malism predicts that halo formation histories at fixed mass 
are independent of large scale overdensity (White (1996)), 
while Berlind et al. (2003) show that the HOD for a baryon- 
mass selected sample from Weinberg et al.'s (2004) smoothed 
particle hydrodynamic simulation has no detectable depen- 
dence on environment. However, while early N-body investi- 
gations (Lemson & Kauffmann 1999; Sheth & Tormen 2004 
showed little or no correlation between halo formation his- 
tories and environment for masses above M ~ lO'"^ /i~'M,7^, 
several recent studies riao et al.ll2005t iHarker etalJl200l 
IWechsler et alJ l2005t l^iuetalT 200(h show much stronger 
correlations at lower masses. The correlation of halo for- 
mation time with large scale overdensity, which is especially 
prominent in the mass range M ~ 10" /!~'Mq of individ- 
ual dwarf galaxy halos, raises the possibility that the galaxy 
HOD itself varies with halo environment, at least in the single- 
galaxy regime. 

Void statistics should be a sensitive probe of any such en- 
vironmental variations, since they characterize extreme envi- 
ronments and are not washed out by the strong signals from 
multiple galaxy halos. In §4, we investigate a simple class of 
extended HOD models in which the minimum host mass M^in 
changes by a fixed factor in low density environments. The 
VPF and UPF prove quite sensitive to such changes, and the 
two statistics provide somewhat complementary information 
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about the density dependence of M^in. By tightly constrain- 
ing this dependence, void statistics can test fundamental ideas 
about galaxy formation and provide essential input to cos- 
mological applications of HOD methods ( Zheng & Weinberg 



2. NUMERICAL SIMULATIONS AND HOD MODELS 

The simulation used in th is paper was perfo rmed using 
the Hashed Oct-Tree code of 'Warren & Salmon' ( 1993), and 
is similar to t hose p resented in Seliak & Warren (2004) and 
IWarren et al. ( 2005). This simulation is substantially larger 
in terms of particle number, with 1280^ particles. The simu- 
lation box is 400 Mpc per side, giving a mass resolution 
of 2.5 X 10'' /i~'Mo. Halos are identified in the simulation 
by the friends-of-friends technique (Davis et al. 1985) with a 
linking length 0.2 times the mean interparticle separation. For 
low-luminosity, M,- -19 galaxies, the minimum mass halo 
roughly corresponds to a 100-particle halo. The linear power 
spectrum used to create the initial conditions was calculated 
with CMBFAST (Seliak & Zaldarriaga 1996), with the parame- 
ter set (n,„,A,(7ii,h,nb) = (0.3,0.7,0.9,0.7,0.04) and a scale- 
invariant inflationary power spectrum. We use the output at 
z = 0.459, where the linear growthfactor is 0.79, to represent a 
universe with erg = 0.7, scaling the peculiar velocities and in- 
ternal dispersions of halos to values appropriate for f7„, = 0.3 
(see Tinker et. al. [2005b] and Zheng et. al [2002] for a dis- 
cussion of this technique). 

This simulation is ideal for our purposes because of its 
high mass resolution and large box size. High mass resolu- 
tion is required to model low-luminosity galaxy populations. 
The lowest luminosity SDSS sample that we can model is 
Mr < -19."* The statistical errors in measuring void statistics 
are dominated by the finite number of independent volumes in 
the simulation (or survey), once the voids become large. The 
(400 /;"' Mpc)-' simulation volume is comparable to that of the 
volume-limited sample of bright Mr < -21 galaxies expected 
for the full SDSS. Therefore, the statistical errors in our cal- 
culations are comparable to those expected from the SDSS 
analysis. The volume of the full SDSS M, < -19 sample will 
be approximately the same as one octant of our simulation, 
and our errors are therefore smaller by ^ \/8. In practice, the 
observational errors will be somewhat larger than the error in 
periodic cubes of the same volume because edges and "holes" 
in the survey region mean that one cannot use the full volume 
for void measurements. 

To determine HOD parameters, we use the measurements 
of Wp(rp) from Zehavi et al. (2005) for galaxies brighter than 
Mr = -21 and M,- = -19. We choose these two samples be- 
cause they probe different regimes of the halo mass func- 
tion; bright galaxies preferentially occupy high-mass halos, 
while the majority of lower luminosity galaxies occupy lower 
mass halos (Zehavi et al. 2005). We use the five-parameter 
HOD model of Zheng et al. (2005), which is flexible enough 
to precisely describe the halo occupation functions predicted 
from semi-analytic galaxy formation models and hydrody- 
namic simulations. The full mean occupation function is the 
sum of the average number of central galaxies and satellite 
galaxies in halos of mass M, i.e. {N)m = (Mat)M+ (A^cen)M- 
For satellite galaxies, the mean occupation is 

All absolute magnitudes are quoted for h = I throughout the paper. For 
other values, one should add 5 \ogh. 
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where M is the halo mass, Mcut is a cutoff mass below which 
a halo cannot host a satellite galaxy. Mi +Mcut is the mass 
at which a halo hosts one satellite on average, and a is the 
power-law exponent of the relation, a number usually near 
one. The distribution of satellite galaxies about the mean is 
assumed to be Poisson, i.e. (A^sat(A^sat - 1)) = (A^sat)^- This as- 
sumption is well-motivated by both co llisionless and hydro- 
dynamic studies of halo substructure (" Kravtsov et al J 12004': 
Zheng et al. 2005), and by observational studies of galaxies 
within clusters (Lin et al. 2004). 

For central galaxies, the HOD of IZheng et al.l ll2005t) con- 
tains a soft transition between one and zero galaxies. 



cen/M — ^ 



1+erf 



logM-logMn^ 
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where erf(x) is the error function, M,ni„ is characteristic min- 
imum mass, and aiogM is a transition width. As defined in 
equation (0, M^in is the mass at which (A^cen)M = 0.5. All 
logarithms are base ten. As ciogM approaches zero, equation 
(O approaches a step function at Mmi„. Since the number of 
central galaxies is assumed to be zero or one by definition, the 
distribution of A^cen about (A^cen)M is a nearest-integer distribu- 
tion (a.k.a. Bernoulli distribution). 

For full details regarding the calculation of Wp(rp) for a 
gi ven cosmolog y an d HOD, we refer th e interested reader 
to Zheng ("2004) and lTinker et alJ (12005 a ) and the references 
therein. In the analys is of thi s paper, we have used the ap- 
proach described in Tinker et al. (2005a) with one modifica- 
tion; the parameters of the halo bias function listed in Ap- 
pendix A (fl = 0.707, b = 0.35 and c = 0.8) have been changed 
to better match those of the simulation we will be using to 
calculate Pq: a and b are the same, but b = 0.28, which lowers 
the bias of low-mass halos by about 5%. (The jackknife er- 
rors in the bias of low-mass halos in the simulation are them- 
selves ~ 5%, so we are not claiming that this value should be 
adopted universally.) The purpose of our HOD modeling is 
to find HOD parameters to populate our simulation. It would 
make little sense to make mock galaxy populations that do 
not match the analytic model or, more importantly, the SDSS 
observations. 

We assume fl,„ = 0.3 throughout this paper. For a fixed lin- 
ear matter power spectrum, the choice of fl,„ has little influ- 
ence on the mass fu nction and clustering of dark matter halos 
dZhen^ et alJl2()0l and therefore has minimal influence on 
these calculations. The shape of Punik) is well constrained 
empirically by the combination of microwave background 
anisotropics and large-scale galaxy clustering measurements 
(e.g., Percival et al. 2002; Spergel et al. 2003; Te gmark et alJ 
i2004ai) . On a more practical level, we are required to match 
the linear matter power spectrum used to create the initial con- 
ditions of the simulation, so all calculations use the cmbfast 
power spectrum with the parameters listed in §2. 1 . This Punik) 
is in good agreement with the aforemention ed observations, 
though the recent analysis of ISanchez et alJ 1.2005.) favors a 
spectrum with somewhat more large scale power. 

For a given galaxy sample, the HOD parameters are de- 
termined through minimization using the full covariance 
matrix estimated through jackknife resampling of the obser- 
vational sample (see Zehavi et al. 2005) . The number of free 
parameters in equations ([!) and (|2j is five, but it is reduced 
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TABLE 1 

HOD Parameters for Models in Figure^ 



Mr 






log Kim 


log Ml 


a 


logMcut 


-19 


0.9 


0.1 


11.54 


12.83 


1.019 


11.38 


-19 


0.9 


0.6 


11.70 


12.84 


1.018 


12.37 


-19 


0.7 


0.1 


11.52 


12.69 


1.020 


12.45 


-21 


0.9 


0.1 


12.68 


13.89 


1.063 


12.25 


-21 


0.9 


0.6 


12.86 


13.89 


1.071 


13.24 


-21 


0.7 


0.1 


12.64 


13.69 


1.107 


13.28 



Note. — All masses are in units of h ' Mq . 



to four by fixing Mmin in order to match the observed space 
density of galaxies. To populate the halos, central galaxies 
are placed at the center of mass of each halo, while satel- 
lite galaxies are placed randomly about the center of mass 
assuming a Navarro. Frenk. & White (1997) density profile 
appropriate fo r the halo's mass using the parameterization of 
iBuUock et alJ OoOl). This technique places the galaxies in 
a spherically symmetric distribution about the central galaxy, 
an assumption which has negligible importance in the context 
of this work because we are most interested in halos that host 
only central galaxies. 

3. CONNECTING VOIDS AND w,,(r„) WITH THE HOD 
3.1. Variations in the Minimum Mass Scale 

The results of B W demonstrated that the VPF is highly sen- 
sitive to the minimum mass scale, and mostly insensitive to 
the amplitude and slope of the occupation function, implying 
that Mn,in can be constrained by measurements of void statis- 
tics. We test this implication with two significant improve- 
ments over the analysis of BW: our central-satellite approach 
to the occupation function, as described in §2, and a full sta- 
tistical comparison of a given HOD model with Wp{rp) data. 

To quantify the range of M^in allowed by the observed 
'Wpirp), we first determine the best-fit values of the HOD pa- 
rameters, including Mmin, for both the M,. < -19 andM^ < -21 
samples with ctiosm fixed at 0. 1 . This value of uiogM creates an 
occupation function with a fairly sharp central cutoff, but the 
strength of the inferred constraints on Mmin will be consistent 
for any reasonable value of CTiogM- We keep triogM fixed to 
isolate the effect of changing the minimum mass scale only, 
without any change in the shape of the central cutoff. The two 
parameters that control (A^cmOm have quite different physical 
interpretations, so we examine their effect on Wpi/p) and void 
statistics separately. The best-fit values of the HOD parame- 
ters for (TioeM = 0.1 are listed in Table 1. For Mr < -19 galax- 
ies, we find Mmin = 3.60 X lO" /;"' M©, while for the brighter 
Mr < -21 galaxies, we find M^^^ = 4.78 x lO'^ /i"' Mq. For 
all calculations, we assume cg = 0.9. 

Figure ^ demonstrates that the allowed range of M^in is 
quite narrow. Panels (a) and (b) plot Ax^ for models in 
which Mn,in is varied around the minimum value for the faint 
and bright galaxy samples, respectively. As M,^in is var- 
ied, the best-fit HOD parameters are determined once again 
by minimization, but with M^in and triogM held constant 
and Ml fixed by the space density. This leave two free pa- 
rameters, a and Mcut, for the analysis. For M, < -19 
galaxies, AlogMmin = ±0.05 produces Ax^ = 6. The con- 
straints for the Mr < -21 sample are even narrower, with 
AlogMmin = ±0.015 producing Ax^ = 6. The change in x^ 



is symmetric about the minimum for both magnitude thresh- 
olds. The tighter constraints onM^n for Mr < -21 are mostly 
a result of smaller observational errors for this sample. 

The tight constraints on Mmin are a result of the information 
Wp(rp) reveals about the fraction of galaxies that are satel- 
lites. The fraction of satellites /sat influences the shape and 
amplitude of the correlation function at small scales, where 
pairs from within a single halo dominate. Once the satel- 
lite fraction is known, the value of Mmin (for a given <J\ofM) 
is determined by the number density constraint. Figures'tQ^ 
and^Jl plot the model Wp{rp) functions for the two values of 
Mmin that yield Ax^ = 6 in the upper panels (3.2 x 10" and 
4.0x 10" /r' Mofor Mr < -19, and4.7x lO'^ and 5.1 x lO'^ 
/i"'M0 for Mr < -21). HOD calculations for the high and 
low M,nin are shown with the solid and dotted lines, respec- 
tively, while points with error bars are the SDSS data. As 
Mmin increases, /sat increases in order to preserve the space 
density of galaxies, steepening and amplifying the one-halo 
term of the correlation function. This behavior is most easily 
seen in the Mr < -21 models in panel (d). The model Wp{rp) 
curves are consistent with each other and with the data until 
r < 1 /r' Mpc, which represents the transition scale between 
the one-halo term and the two-halo term. Below this scale the 
curves diverge, with the high-Mmin curve above the low-Mmin 
curve. 

The behavior for the Mr < -19 models in panel (c) is a bit 
different: the high-Mmin model has a steeper slope in the one- 
halo regime, but its amplitude is lower except at very small 
scales. In this case, the adjustment of a and M^ut after chang- 
ing Mmin provides enough freedom to lower the one-halo con- 
tribution to Wp{rp). If we keep these parameters fixed in addi- 
tion to (TiogM and the (changed) value of M,nin, we obtain the 
dashed curve in Figure Ic, which lies well above the data all 
scales. 

Figure |2l shows the void statistics for the high- and low- 
Mmin models of Figure Q The VPF is calculated for each 
galaxy distribution by randomly placing lO*" spheres of radius 
r within the box and counting the fraction that are empty. The 
UPF is calculated in the same manner as the VPF, but each 
sphere which has a mean interior density less than 0.2 times 
the mean galaxy density is counted. The error at each r is de- 
termined from the jackknife method, dividing the simulation 
volume into octants. Alternatively, Hovle & Vogelev (2003) 
estimate observational errors from the error in the mean of 
void counts expected if these follow a binomial distribution: 
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Fig. 1 .— Panel (a): Ax^ of HOD models as a function of Mniin with respect to the Mr < — 19 vvp(rp) data. All models have <j]„aM = 0.1, but the parameters that 
determine (Wsat)*/ are left free to match the data. Panel (b): Same as (a), but for Mr < —21 galaxies. Panel (c): Best-fit HOD models for Mr < -19 for the highest 
and lowest values of M^in with Ax^ ~ 6 from panel (a). The soUd line is M„^;„ = 4.0 X lO" /i"' Mq, and the dotted line is Mmin = 3.2 X lO" ft"' Mq. The 
dashed line represents the high-Mn,i„ model with the parameters of (A'sat)M taken from the best-fit model from Table 1. Panel (d): Same as (c), but for Mr < -21. 
The solid line is M,ni„ = 5.1 X lO'^ ft"' Mq, and the dotted lines is M„,in = 4.7 X lO'^ ft"' Mq. 



where A^ind. = S/^j^^^/ (A-nr^) is the number of independent vol- 
umes in the box at each r for which Pq is calculated. For the 
VPF, the jackknife errors are consistent with those determined 
from equation (|3} at radii where Po{r) w 0.01, while at larger 
radii the jackknife errors are smaller by a factor of two or 
more. At smaller radii, the jackknife errors are significantly 
larger, but the percentage errors are still less that 1%. All 
calculations are done in redshift space using the distant ob- 
server approximation, allowing us to use one axis of the box 
as the observer's line of sight and impose periodic boundaries 
for our galaxies shifted past the edge of the cube. We find lit- 
tle difference between the void stati stics calculated in and re al 
and redshift space, as also noted bv Little & W einberg ( 1994). 

For the high space density, Mr < -19 sample, the predicted 
VPF falls rapidly, dropping to Po{r) = 10"^ at r = 1 1 /i"' Mpc 
(Figure 2a). As expected, the low-Mmin model has a lower 
VPF at all radii, but because the range in Mj„i„ allowed by the 
Wpifp) and constraints is so small, the difference between 
the low- and high-Mmi,, models is small. Since our errors bars 
for this simulated sample should be smaller than those for the 



full SDSS, the difference is likely to be indiscernible in prac- 
tice. Note that errors at different r are highly correlated, as 
any large void would also contain smaller empty spheres, so 
the points do not scatter about a smooth curve by the amount 
of their error bars. For the < -21 sample, voids are larger, 
with Po(r) = 10"^ at r = 20 Mpc (Figure 2b). The tight con- 
straints on M,ni„ for this sample lead to a very small difference 
between the low- and high-Mmi,, VPFs, again smaller than the 
expected SDSS errors. 

The UPFs for the faint and bright samples are shown in Fig- 
ures |2j; and|2ll, respectively. It is more likely to find a region 
with low density contrast than a completely empty region, so 
Pu > A) at all r. The UPF is also not as steep a function of 
r, since the Ug < Q.lfig threshold allows a larger number of 
galaxies in larger radius spheres. The results for the UPF par- 
allel the results of the VPF; there is a small difference in the 
low- and high-Mmin models, with high-Mmin producing higher 
Pu, but the difference between the models is only marginally 
significant. 

BW reached different conclusions about the sensitivity of 
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Fig. 2. — Panel (a): Void probability function (VPF) for the high- and low-M,^i„ models of FigureQJ for the Mr < -19 galaxy sample. Panel (b): Same as (a), 
but for the M,. < -21 sample from FigureQl. Panel (c): Underdense probability distribution (UPF) for the high- and low-M^jn models of the M,- < -19 sample. 
Panel (d): Same as (c), but for M,- < -21 galaxies. 



the VPF to Mmin because they did not impose Wp{rp) as a con- 
straint. They varied Mmi„ by a factor of 8, producing radical 
changes in the VPF, but also changing the correlation func- 
tion. The VPF is sensitive to M^in, but Figures 1 and 2 show 
that Wp{rp) constrains M,^in tightly enough that void statis- 
tics cannot add much further leverage. This result was also 
found by Conr ov et alJ (|2p05) in their analysis of voids in the 
DEEP2 and SDSS surveys. The constraints onM^n in Figure 
^will become tighter as the full sample of SDSS galaxies is 
analyzed, significantly reducing the errors on Wp(rp). 

3.2. Variations in ag and the Central Cutoff 

The shape of the central galaxy cutoff reflects the scatter 
between galaxy properties and host halo mass. A soft cutoff 
corresponds to significant scatter, allowing some halos with 
M ^ Mmin to host a galaxy within the sample and some ha- 
los with M > Mmin to be scattered out of the sample. In the 
context of our luminosity-threshold samples the scatter is be- 
tween luminosity and mass, but it can be reflected in any 
quantity used to define the sample, such as surface brightness, 
color, or nuclear activity. 

The shape of the central cutoff may alter void statistics be- 



cause the halos that outline the voids are low mass. Numerical 
simulations of void regions show that the maximum halo mass 
within voids increases from the center of the void to its outer 
edge (Gottlober et al. 2003). Allowing some galaxies to oc- 
cupy halos below M^in could effectively "shrink" the voids. 
To quantify the effect of the shape of (A^cen)M, we fix tiiogM 
at six values ranging evenly from 0.1 to 0.6. At criogM = 0.1, 
(A'cen) M is nearly a step function at Mmin, while for tiiogM = 0.6 
the probability of finding a galaxy in a halo of mass M^in/ 10 
is still ^1%, which can be significant given the steepness of 
the halo mass function. For each value of criogM, we find the 
best-fit values of the four remaining HOD parameters by 
minimization. 

We also investigate the influence of the dark matter clus- 
tering amplitude by considering a model with (TiogM = 0.1 
but (Tg = 0.7. Power spectra with lower amplitudes produce 
smaller voids in the matter distribution, but the effect on 
galaxy voids is not immediately clear. A model with lower 
dg must have higher galaxy bias to match Wp{rp). This re- 
quires shifting more galaxies into higher mass halos, and thus 
out of low-density regions. 
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Fig. 3. — HOD fits to SDSS samples for M, < —19 (left panels) and M,- < -21 (right panels). Panels (a) and (b): Projected coiTelation functions. Points with 
error bars are SDSS data. Lines represent HOD models (see legend in panel d). Panels (c) and (d): Halo occupation functions derived from the data. The x-axis 
is halo mass and the y-axis is the average number of galaxies. 



Figure]^ plots Wp(rp) for the data for Mr < -19 and for 
three of the HOD models: (as,aiogM) = (0.9,0.1), (0.9,0.6), 
and (0.7,0.1). The two models with higher o-g are excellent 
fits to the observations and are nearly indistinguishable both 
visually and in x^- Panel (c) plots {N)m against M for these 
three models. Increasing <t\o„m produces virtually identical 
correlation functions because the bias function of halos be- 
low ~ 10'^ /;~'M0, where the differences in {N)m begin, is 
relatively flat. Redistributing galaxies from halos of 2 x 10'^ 
Mq into lower mass halos near 10" /;"' Mq does not sig- 
nificantly change the large scale galaxy bias factor (0.94 and 
0.93 for (TiosM = 0.1, 0.6). The large-scale clustering is thus 
unchanged. The differences in (A^sat)M are minimal, so the 
one-halo term is also unaltered. The only change in Wp{rp) 
from (TiogM = 0.1 to 0.6 is in the two-halo term at small scales, 
r < 0.8 Mpc, but at these scales Wp(rp) is dominated by 
the one-halo term and the changes cannot be distinguished. 
The fit for the erg = 0.7 model does not match the data as well 
and has a Ax^ of approximately five relative to the erg = 0.9 
fits. The low amplitude of matter clustering for this model can 
only be partially overcome by creating an occupation function 



that has more galaxies in high-mass halos. Changes to CTiogM 
make little difference in the agreement. 

Figure |3j) shows the HOD fits to the M,- < -21 sample for 
the same three models. All three appear nearly equivalent, but 
statistically, the (crgjCriogM) = (0.9,0.1) model has = 5.5, 
while the (0.9,0.6) model has x^ = 11-4 and (0.7,0.1) has 
X^ = 10.0. For both values of o-g, the x^ of the HOD fit in- 
creases monotonically with criogM- In contrast to the M,- < -19 
sample, moving galaxies from lO'-' M0 halos into smaller 
ones has a noticeable effect on the average halo bias of the 
sample (the galaxy bias is 1.20 and 1.15 for uiogM = 0.1 and 
0.6 respectively). The subsequent HOD model is less able to 
match the observed amplitude of galaxy clustering. 

Figure|4]shows the values of /sat determined from the best- 
fit HOD parameters as a function of uiogM, for erg = 0.9. For 
the Mr < -19 sample, /sat for all models is approximately 
0.23, with only modest variation around this mean. For 
Mr < -21 galaxies, /sat ~ 0.15 for all criogM values. For higher 
ciogM, the best-fit is also higher to compensate for the 
additional number of central galaxies occupying M < M^in 
halos. But regardless of the shape of (A'cen)M, the correlation 



8 



TABLE 2 

Predicted void Statistics form, < -19 
Galaxies 



r irl ivipcj 


"(){') 




1 


(9.50 ±0.01) X 10-' 


(9.50 ±0.00) X 10-' 


2 


(7.62 ±0.02) X 10"' 


(7.61 ±0.02) X 10"' 


3 


(5.21 ±0.02) X 10-' 


(5.21 ± 0.03) X 10"' 


4 


(3. 12 ±0.03) X 10-' 


(3.12 ±0.03) X 10-' 


5 


(1.67±0.02) X Ur' 


(2.75 ± 0.03) X 10"' 


6 


(7.95 ±0.19) X 10-2 


(2.07 ±0.03) X 10"' 


7 


(3.51 ±0.12) X 10-2 


(1.79±0.03)x 10"' 


8 


(1.39 ±0.07) X 10-2 


(1.35 ± 0.03) X 10"' 


9 


(5.10±0.35) X 10-3 


(1.08±0.03)x 10"' 


10 


(1.78 ±0.20) X 10-3 


(7.92±0.33)x 10-2 


11 


(5 .44 ±0.77) X 10-* 


(6.14±0.31)x 10"2 


12 


(1.51 ±0.36) X 10-* 


(4.68 ± 0.29) X 10"2 


13 


(2.60 ±0.95) X 10-^ 


(3.53±0.26)x 10-2 


14 


(1.00 ± 1.00) X 10-' 


(2.64 ± 0.24) X 10-2 


15 




(1.96±0.21)x 10-2 


16 




(1.44±0.19)x 10-2 


17 




(1.05±0.16)x 10-2 


18 




(7.49 ± 1.38) X 10-3 


19 




(5.24±1.17)x 10-3 


20 




(3.56 ± 0.97) X 10-3 


21 




(2.38 ±0.78) X 10-3 


22 




(1.57 ±0.62) X 10-3 


23 




(1.01 ±0.47) X 10-3 


24 




(6.31 ±3.51) X 10-" 


25 




(3.85 ±2.53) X 10-* 
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Fig. 4. — Fraction of galaxies that are satellites, /sai, as a function of crjogM 
for both M, < -19 and M, < -21 galaxies. 



function drives the HOD to the same value of /sat, reaffirming 
our conclusion from §3.1 that the satellite fraction is a funda- 
mental quantity for determining the shape and amplitude of 
the correlation function. 

Figure|5]plots the void statistics for the same HODs and lu- 
minosity samples presented in Figure 13 Panel (a) plots Po{r) 
for the three HOD models of the -19 sample. For (7g = 0.9, the 
ciogM = 0.6 model has smaller voids, as expected. Since /sat is 



the same for a\ogM = 0.1 and 0.6, the two models have equal 
number densities of single-galaxy halos, but the criogM = 0.6 
model allows some galaxies in lower mass halos that are more 
likely to be found in void interiors. However, while the sign 
of the effect is as expected, the magnitude is small, and only 
slightly larger than our statistical error bars. For example, the 
radius at which Po{r) = 10"^ changes by less than 1 /;"' Mpc. 
Results for the UPF (Figure 5c) and for the Mr < -21 sam- 
ple (Figures 5b and 5d) are similar: the criogM = 0.6 model 
has smaller voids, but the changes are too small to be readily 
detectable. 

Dashed curves in each panel in Figure 5 show the 
(crgjO-iogAf) = (0.7,0.1) model predictions. The increased bias 
almost exactly compensates for the lower matter clustering 
amplitude. There is a slightly lower UPF for the M,- < -19 
sample, but this difference is small, and the other predictions 
are nearly indistinguishable from those of the (ag, criogM) = 
(0.9,0.1) model. 

The insensitivity of void statistics to criogM implies that sub- 
Mntin halos avoid the low density regions almost as strongly as 
Mntin halos. The total number density of occupied low mass 
halos matters, but it makes little difference whether galaxies 
populate the "high end" of the low mass halo regime or pop- 
ulate it uniformly. Figure |6ldemonstrates this point with two 
pedagogically illustrative model sequences. In the first, we 
vary /sat while keeping the space density of galaxies fixed to 
that of the M,- < -21 sample. These models are no longer 
constrained to match the correlation function, and the role of 
/sat is only to determine the number density of occupied halos, 
i.e., «/, = (1 -/sat)ng- For each /sat, Mmin is set by the constraint 

and all halos M > Mmin are given a single, central galaxy. Fig- 
ure |6^ plots the VPF for these models with /sat = to 0.5 in 
steps of 0.1. For comparison, the VPF for the M,- < -21 sam- 
ple with criogM = 0. 1 is shown with the open circles. The satel- 
lite fraction for this model is 0.15, and its predictions lie in 
between those of the /at = 0.1 and /at = 0.2 models. The 
VPF monotonically increases with increasing /sat because the 
number density of occupied halos decreases. To demonstrate 
the relative unimportance of spreading galaxies over a range 
of low mass halos, we have created a model sequence in which 
/sat is held constant but (A^cen)M is set to be a constant number 
less than unity at all masses. In these models, is set by 
the constraint 

nh = I {Ncen)M-J^dM, (5) 

using the nearest integer distribution to place central galax- 
ies in halos. Figure |6|d shows the VPF for models in which 
/sat = 0.2 and (A^cen)M = 1-0 to 0.5 in steps of 0.1. In these 
models the number density of occupied halos is constant, but 
as (A^cen)M dccrcascs the average mass of occupied halos de- 
creases. The results in Figure |6l3 show a trend of lower VPF 
for lower (A^cen)M, but the amplitude of the effect is small 
compared to that of varying /at- 

These results demonstrate that models that are constrained 
to match Wp(rp) offer a remarkably robust prediction of the 
VPF and UPF, regardless of the form of the HOD or the value 
of erg (at least in the range 0.7-0.9 considered here). We 
list the values of Pq and Pu determined for the (erg, criogM) = 
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Fig. 5. — Void statistics for tlie HOD models sliown in Figure|3] Left panels present results for M,- < -19, and right panels show results for M, < — 21. Panels 
(a) and (b): Void probability functions for the three HOD models shown in Figure|3] Error bars are shown for the (erg , uiogm) = (0.9, 0. 1) models only, but are the 
same for other models. Panels (c) and (d): Underdensity probability functions for the same models. 



(0.9,0.1) models in Tables 2 and 3 for < -19 and M, < 
-21, respectively. 

4. DENSITY DEPENDENCE OF THE HOD 

Motivated by the points discussed in §1, we now con- 
sider models in which the HOD — specifically the minimum 
mass scale for central galaxies — changes systematically with 
environment. To the extend that a galaxy sample is de- 
fined by an observable property that correlates strongly with 
halo formation time, the dependence of fo rmation time on 
large scale overdensity ( Gao et al...2005>: .Harker et alJl2005t 
IWechsler et al. 2005) will drive such variations in the HOD. 
The strength of this dependence increases rapidly toward 
lower halo masses, so it is interesting to examine environmen- 
tal effects for both faint and bright galaxy samples. 

Even in the deepest voids in the dark matter, halos still form 
£jottl6beret al. 2003). Figure^ shows the fraction of halos 
below a given overdensity as a function of halo mass. For each 
halo in the simulation, the surrounding dark matter density is 
calculated in a top-hat sphere 5 /;"' Mpc in radius. Although 
the choice of smoothing radius is to some degree arbitrary, 
we adopt 5 Mpc over a larger radius because environmen- 



tal dependence in low-density regions may be sensitive not 
just to whether the halo resides in a void, but also to where 
in the void, and the local matter density increases closer to 
the edge. At 10^' /i"'Mq approximately 40% of the halos 
are in underdense regions, and ~ 10% are in regions with 
S < -0.4. These fractions monotonically decrease with in- 
creasing halo mass, until the fraction of halos in underdense 
environments is nearly zero for all halos above 10'^ Mq. 
Figure 05 plots the mass function for the halos in different 
environments. From left to right, the solid lines represent 
mass functions for halos in regions with 6 < -0.8 to 5 < 
in steps of 0.2. Each mass function can be approximated by 
a power-law at low masses with a n exponential cutoff at high 
masses ( Press & Schechte3 [T97l . The change from power- 
law to exponential occurs at a mass scale denoted M*. The 
value of for each mass function plotted in panel (b) corre- 
sponds to the mass at which the curves in panel (a) turn over 
and begin to approach zero. The arrows indicate M,^in for 
Mr < -19 (upward arrow) and Mr < -21 (downward arrow), 
indicating that bright galaxies reside in regions with matter 
densities 6 > -0.2, but that faint galaxies reside in nearly all 
environments. 
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Fig. 6. — Panel (a): VPFs for models with varying /sat but fixed overall 
space density of galaxies. From bottom to top, the 6 solid lines represent 
/sat = to 0.5 in steps of 0.1. The open circles represent the Mr < -21 results 
from Figure 121), which have /at = 0.15. Panel (b): VPFs for models with 
varying (A'cen)M, but fixed /sat = 0.2. From bottom to top, the solid lines 
represent (A'cen)M = 0.5 to 1.0 in steps of 0.1 (and are constant for all halo 
masses). The open circles are the same as panel (a). 



Our model for the density dependence of halo occupation 
is simple: below a given density threshold S^, the minimum 
mass that can host a galaxy (in the galaxy class being mod- 
eled) increases by a factor f^in- For M,- < -19, we will exam- 
ine models in which ranges from -0.8 to -0.2 and f^m = 2, 
4, and oo (i.e., galaxy formation is halted below 6c for this 
class of galaxies). A physical model would presumably pre- 
dict a smooth transition in which the probability of hosting 
a sample galaxy changes continuously with local density, but 
the quantitative effect is most likely degenerate with our two- 
parameter model. In our parameterization it is straightfor- 
ward to isolate the two characteristics of environmental de- 
pendence: the density at which a change occurs, and magni- 
tude of the shift in the HOD. 

Although the relative number of galaxies in extremely un- 
derdense regions is low, altering the galaxy distribution in 
these regions does have some impact on the correlation func- 
tion. Removing galaxies from anti-biased halos (and placing 
them in higher-density environments in order to keep the to- 
tal space density of galaxies fixed) increases the overall bias 
of the galaxy sample. This increase can be masked to some 
degree by changes in other HOD parameters. For each 6c and 
/min we recalculate HOD parameters to obtain a minimum 
fit to the SDSS Wp{rp) measurements, but we now use the 
populated halos of the N-body simulation to calculate Wp{rp) 
rather than the analytic model. We leave the value of CTiogM 
as a free parameter and start the minimization at the HOD for 



TABLE 3 

Predicted void Statistics for Mr < -21 



Galaxies 


r [h ' Mpc] 


^'o('-) 


Puir) 




1 


(9.95 ±0.01) X 10"' 


(9.95 ±0.01) 


X 10"' 


2 


(9.65 ±0.01) X 10"' 


(9.65 ±0.01) 


X 10"' 


3 


(8.99 ±0.02) X 10"' 


(8.99 ±0.02) 


X 10"' 


4 


(8.05 ±0.03) X 10"' 


(8.05 ±0.03) 


X 10"' 


5 


(6.90 ±0.03) X 10"' 


(6.90 ± 0.03) 


X 10"' 


6 


(5.68 ±0.04) X 10"' 


(5.68 ±0.04) 


X 10"' 


7 


(4.47 ±0.04) X 10"' 


(4.47 ± 0.04) 


X 10"' 


8 


(3.39 ±0.05) X 10"' 


(3.39 ±0.05) 


X 10"' 


9 


(2.47 ±0.04) X 10"' 


(2.47 ± 0.04) 


X 10"' 


10 


(1.73±0.04) X 10"' 


(3.05 ±0.04) 


X 10"' 


11 


(1.17±0.04) X 10"' 


(2.13 ±0.05) 


X 10"' 


12 


(7.51 ±0.37) X 10"2 


(1.62 ±0.04) 


X 10"' 


13 


(4.73 ±0.30) X 10"^ 


(1.49 ±0.05) 


X 10"' 


14 


(2.81 ±0.25) X 10"- 


(1.14 ±0.04) 


X 10"' 


15 


(1.62±0.19)x 10"- 


(9.82 ±0.45) 


X 10"2 


16 


(8.89± 1.31) X 10"3 


(8.15 ±0.43) 


X 10"2 


17 


(4.72 ±0.96) X 10"3 


(6.45 ±0.41) 


X 10"2 


18 


(2.30 ±0.64) X 10"3 


(5.16±0.39) 


X 10-2 


19 


(1.15±0.45) X 10"3 


(4. 16 ±0.37) 


X 10-2 


20 


(5.15±2.51) X 10"** 


(3.34 ±0.34) 


X 10-2 


21 


(2. 19 ± 1.30) X 10"** 


(2.62 ±0.31) 


X 10-2 


22 


(9.50 ±6.54) X 10"5 


(2.03 ±0.28) 


X 10-2 


23 


(3. 10 ±2.05) X 10"^ 


(1.59 ±0.25) 


X 10-2 


24 


(1.20± 1.20) X 10"5 


(1.23 ±0.23) 


X 10-2 


25 


(3.00 ±3.00) X lO"** 


(9.31 ±2.01) 


X 10-3 



the density-independent HOD found in the previous section 

forcriogM = 0.1. 

4.1. Results for Ml- < -19 

Figure |8l shows several examples of Wp(rp) for density- 
dependent models of the M,- < -19 sample. Panel (a) plots 
the correlation functions for = oo and 6c = -0.8, -0.6, 
and -0.4. The SDSS data are also shown for comparison. 
For 6c = -0.8, the change in the correlation function is neg- 
ligible. For 6c = -0.6, the exclusion of low-density halos no- 
ticeably increases the large-scale amplitude of Wp{rp). This 
effect is magnified for the 6c = -0.4 model, and the result- 
ing correlation function is clearly excluded by the data. The 
Ax^ between 6c = -0.8 and -0.6 is 1.4, while for 6c = -0.4 
the Ax^ « 10. Panel (b) in figure |8l shows the results for 
/min = 2. This more moderate model of density dependence, 
where galaxy formation is suppressed but not eliminated in 
low density environments, has a smaller effect on Wp(rp) at 
the same value of 6c- 

The effect of density dependence on Wa(r„) can be inter- 
preted through the "voided Poisson" model of Babul ( 19911) 
and Ba bul & White. C1991 ). Removing galaxies from low- 
density regions makes the voids larger. Since the galaxy num- 
ber density is held fixed, a simple random distribution of the 
displaced galaxies throughout the volume not occupied by the 
voids increases the clustering at scales less than the character- 
istic size of the voids. The stronger the density dependence, 
the higher the boost in Wp(rp). Appendix A shows that this 
simple model provides a surprisingly accurate description of 
our numerical results. 

Figure|5]shows the VPFs for o-g = 0.9 models with the three 
values of /min. In each panel, the filled circles with error bars 
show the results for criogM = 0.1 from Figure |5] The lines. 



11 



0.5 



0.4 



0.3 



0.3 



0.1 



0<^U.\J 


. . 1 . 1 . 1 . 


- 


- 6<-0.2 




- 


: (5<-0.4 \^ 






_ 5<-0.6 \ 






■ , 5<-,0.8 — . . . 1 







10 



11 



12 



13 



14 



-a 




12 

log[M,/(h- 

FlG. 7. — Panel (a): Fraction of halos that lie in regions below a given 
matter density as a function of lialo mass. Tlie top line represents tlie fraction 
of halos that lie in underdense (S < 0) regions as a function of halo mass. 
Subsequent lines present results for lower density contrasts as labeled in the 
panel. Panel (b): Mass functions of halos in low-density environments. Solid 
lines show the mass functions for halos in 5 < —0.8 regions (far left) to 5 < 
(far right), corresponding to the bottom and top lines in panel (a) respectively. 
The upward pointing an'ow marks the location of M,ni„ for the Mr < —19 
sample, and the downward arrow indicates M„,in for < — 2 1 galaxies (for 
""logM =0.1). For all density calculations, the smoothing radius is 5 ft"' Mpc. 



from bottom to top, plot Pair) for S,. = -0.8, -0.6, -0.4, and 
-0.2. (from lowest to highest). Panel (a) shows the results for 
/min = 2. For 6c = -0.8 (the lowest line), the effect on Pq is neg- 
ligible. This is to be expected from Figure 0because nearly 
all halos above are in regions above (5 = -0.8 anyway. 
For 6c > -0.6, the change in Pq is substantial; at Pq = 10""^, the 
change in radius is nearly 2 Mpc, while at 12 /z"' Mpc, the 
change in Pq is ~ 1 dex. The VPFs exhibit little further change 
above Sc = -0.6 because halos above 2M^i„ are still common 
at these densities. The effect of environmental dependence on 
Pq thus "saturates" at a density that we denote 6,- In the con- 
text of Figure0 if M^m < M^X6c) < fmiaMmia, then the den- 
sity dependence eliminates most halos that would have been 
able to host galaxies because /minA^min is on the exponential 
cutoff of the mass function for regions with S < Sc- This sig- 
nificantly increases the probability that a low-density region 
that was otherwise occupied is now counted as a void. If both 
Mmin and /minA^min are below M*, then the density dependence 
moves the minimum mass scale along the power-law portion 
of the halo mass function, but not into the exponential cut- 
off. The number of halos massive available to host galaxies is 
reduced, but the fractional change in galaxy-hosting halos is 
much smaller. Thus the likelihood of increasing the sizes of 
voids is low. In order to further expand the voids, must 
be increased. 

Figure Isj) plots the results for /min = 4. For Sc = -0.8, the 



VPF is similar to that of f^i„ = 2, and the VPF again changes 
dramatically for Sc = -0.6. Because of the higher /min, "satu- 
ration" sets in at a higher threshold density 6c ^ - = 0.4. At 
this 6c, the VPF is greatly amplified relative to the density- 
independent models; the change in the radius at which fo = 
10"'* is ^ 3.5 Mpc, and the increase in Pq at 12 Mpc 
is nearly 1.5 dex. Increasing /min further effectively removes 
the saturation effect. Panel (c) shows the results for /min = oo. 
This model evacuates regions below Sc completely, regardless 
of halo mass. At a fixed radius the probability of finding a 
void increases monotonically with Sc. 

Figure no| presents the UPF results for the same models. 
The lines and points in each panel correspond to the same 
models at Figure |9] Density dependence in the HOD has a 
different effect on the UPF than on the VPF. An increase in 
Sc at a given /min always reduces the number of galaxies in a 
region of low density, even though that region may not be a 
completely empty void. Therefore the saturation effect seen in 
the VPF is not as striking for the UPF. In panel (a), increasing 
Sc always results in a higher UPF, but the difference between 
the models gets smaller at higher Sc. As with the VPF, the in- 
crease in Pu is lai-ger for higher /min- At (/mi,,, (5c) = (4,-0.4), 
the change in r with respect to the density-independent mod- 
els is 5 /?"' Mpc for Py = 10"^. At r = 20 Mpc, the increase 
in Pu is 0.7 dex. 

4.2. Results for Mr < -21 

Our density-dependent HOD models for the brighter, M,. < 
-21 galaxies employ the same values of /min, but larger val- 
ues of Sc are required to produce a noticeable change in the 
galaxy distribution. For conciseness, we present results for 
/min = 2 only, but increasing /min has the same qualitative ef- 
fect as shown for the Mr < -19 sample. Figure fTTt plots 
Wp(rp) for Sc = -0.2, 0, and +0.2. Recall that the density- 
independent HOD fits from Figure|3lwere statistically accept- 
able, but that the large-scale amplitude of the models was 
slightly below the observed level. Boosting Wp{rp) by adding 
density-dependence to the HOD decreases for Sc ^ 0, but 
the fit becomes poor for Sc > +0.3. Figurefllti plots Po(r) for 
models with Sc = -0.4 to +0.4 in steps of 0.2. At Sc = -0.4 
there is no change in the sizes of voids. The effect of en- 
vironmental dependence does not become significant until 
Sc > -0.2. As with the Mr < -19 models, the effect on the 
VPF reaches a saturation limit at high Sc. For /min = 2 the sat- 
uration density is S^ = 0. Figure fTTt plots Puir) for the same 
values of Sc. As with the faint galaxy sample, as Pu(f) in- 
creases monotonically as Sc increases. The effect on the UPF 
also becomes detectable at Sc > -0.2. 

Figure El roughly delineates the areas in parameter space 
that produce weak and strong effects on galaxy clustering. 
The gray lines show the limit where density dependence pro- 
duces measurable changes in the void statistics. The lower 
gray line is for Mr < -19 and the upper gray line is for 
Mr < -21. Above the gray lines, there is a region of pa- 
rameter space in which the effect on Wp{rp) is weak, defined 
as Ax^ < 4 with respect to the best-fit density-independent 
model. The black Unes denote the transition between strong 
and weak influence on Wp{rp). For low /min, the region of 
weak dependence spans a larger range in Sc. As /min increases, 
the lines flatten out at Sc = -0.5 and +0.1 for the -19 and -21 
samples respectively. Below these densities there are rela- 
tively few halos above M^in; there are enough to alter the dis- 
tribution of voids but not to change the overall bias of the 
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Fig. 8. — Projected correlation functions for density-dependent HOD models of the Mr < —19 sample, (a) Models in which M,. < —19 galaxies are completely 
suppressed (/„,!„ = oo) below critical densities 5c = —0.8, —0.6, and -0.4. Points with error bars are the SDSS data, (b) Models in which the minimum host halo 
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Fig. 9. — Void probability functions for density-dependent HOD models 
of the M, < —19 sample. In all three panels, lines are density-dependent HOD 
models while points show the thi'ee models, from Figure|5j. Solid points with 
error bars represent the (crg.a-ia^M) = (0.9,0.1) model, open triangles are the 
(0.9,0.6) model, and open squares are the (0.7,0.1) model. Panel (a): Models 
with /n,in = 2. The four lines, from bottom to top, show results for Sc = -0.8, 
—0.6, —0.4, and —0.2. Panel (b): Models with = 4 for the same values of 
5c ■ Panel (c): Models with /min = oo for the same values of 5c. 




r [h~' Mpc] 

Fig. 10. — Underdensity probability functions for density-dependent 
HOD models of the M, < —19 sample. In all three panels, lines are density- 
dependent HOD models while points show the three models from Figure Ist. 
Solid points with error bars represent the (erg, ciogM) = (0.9,0.1) model, open 
triangles are the (0.9,0.6) model, and open squares are the (0.7,0.1) model. 
Panel (a): Models with = 2. The four lines, from bottom to top, show 
results for 5c = -0.8, -0.6, -0.4, and -0.2. Panel (b): Models with = 4 
for the same values of 5c. Panel (c): Models with = oo for the same 
values of 5c. 
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Fig. 1 1 . — Clustering statistics for density-dependent HOD models of the 
Mr < -21 sample. All models have = 2. Panel (a): Projected correlation 
function. Points with errors are the SDSS data from Figurel3l Line types are 
results for 5c = -0.2 (solid), 5c = (dotted), and 5c = +0.2 (dashed). Panel (b): 
Void probability functions. SoHd points with error are results from Figurelsli 
for (o-g,aia^u) = (0.9,0.1). Open squares plot the results for (0.7,0.1). The 
five lines show results for 5c = —0.4 to +0.4 in steps of 0.2. Panel (c): Under- 
density probabiHty functions for the same models shown in panel (b). The 
open squares are plotted but cannot be distinguished from the other points. 
As with the —19 samples, increasing 5c corresponds to increasing VPF and 
UPF. 



galaxy sample. 

5. SUMMARY AND DISCUSSION 

We have examined how non-linear galaxy bias, described 
in the HOD framework, affects the distribution of void sizes 
once the galaxy number density «g and projected correlation 
function Wp{rp) are imposed as observational constraints. Af- 
ter choosing HOD parameters that reproduce these observ- 
ables as measured for SDSS galaxy samples with M, < -19 
and Mr < -21, we compute VPFs and UPFs by populating 
the halos of a large, high-resolution N-body simulation of a 
ACDM cosmological model. We consider "standard" mod- 
els in which the galaxy HOD is independent of environment 
and models that incorporate a simple form of environmental 
variation. 

Our parameterization of the HOD incorporates a smoothly 
truncated step function for the mean occupation (A^cen)M of 
central galaxies and a smoothly truncated power-law for the 
mean occupation of (A^sat)M of satellites. Void statistics are 
sensitive to (A^cen)M, since halos massive enough to host mul- 



l°g(f„,„) 

Fig. 12. — Regions in the fmm~Sc parameter space where environmental 
dependence of the HOD has a readily measurable impact on void statistics 
or on the projected coirelation function, for galaxies with M,- < — 19 (lower 
lines) or M,- < — 21 (upper lines). The gray lines denote the space above 
which the effect on the void statistics is measurable. The black lines denote 
the space above which the effect on n'p(rp) is strong (Ax" > 4). Between 
the gray and black lines are the regions of parameter space in which environ- 
mental dependence is detectable with void statistics but does not significantly 
affect the correlation function. 



tiple galaxies generally reside in overdense regions. If the 
width of the cutoff in (A^cen) m is specified, then the cutoff mass 
scale Mmin is tightly constrained by the combination of Wp(rp) 
and tig, and variation of M,ni„ in the allowed range produces 
almost no change in the VPF or UPF. Broadening the cutoff 
in (A^cen)*? allows some galaxies to occupy lower mass, less 
biased halos, slightly reducing void sizes, but the effects are 
smaller than the statistical errors expected from the SDSS. 

More generally, fitting the projected correlation function 
tightly constrains the fraction /sat of galaxies that are satel- 
lites, with /sat w 0.23 and ^t ~ 0.15 for M,- < -19 and 
Mr < -21, respectively, assuming erg = 0.9. Once /sat is pinned 
down by Wp{rp), the fig constraint determines the space den- 
sity of single-occupancy halos, and because these are already 
in the mass regime where spatial bias depends weakly on halo 
mass, the void statistics are essentially fixed. When erg is 
lowered to 0.7, the HOD changes required to increase bias 
of the correlation function almost exactly compensate the re- 
duced void sizes in the matter distribution, leading to nearly 
identical galaxy void statistics. Thus, the assumption of an 
environment-independent HOD that reproduces the observed 
Wp{rp) and fig leads to a robust prediction of the VPF and UPF. 

Void statistics are, however, sensitive to differences in the 
minimum halo mass scale between low and high density re- 
gions. We have introduced parameterized models that incor- 
porate such environment-dependent HODs by changing the 
cutoff mass Mmin by a factor /„!„ in regions where the large 
scale density contrast S (defined with a 5/i~' Mpc tophat) is 
below a critical value Sc. For large values of Sc (above ^ -0.5 
for Mr < -19 and 0.1 for Mr < -21, depending on /min), 
density dependence of the HOD changes the correlation func- 
tion in a way that cannot be masked by varying other HOD 
parameters. However, there is a substantial range of if min, Sc) 
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two-point function, and these can themselves change with the 
HOD. 

Recent N-body studies show a strong correlation between 
halo for mation time and l arge scale environm e nt for low mass 
halos (Gaoetal. 2005; Barker etal. 2005; Wechsler et alj 
2005; Zhu et al. 2006) — in particular, the oldest halos avoid 
low density environments. At fixed halo mass, an older halo 
could plausibly host a more luminous central galaxy because 
of more efficient gas cooling and cannibalism of satellites, or 
it could host a less luminous central galaxy because of greater 
fading of stellar populations (see Zhu et al. 2006 for exam- 
ple calculations). Precise measurements of void statistics for 
the SDSS galaxy samples modeled here will provide an ex- 
cellent probe of the link between halo formation time and 
r-band galaxy luminosity. A strong correlation between the 
two would cause significant departures from the "standard" 
(environment-independent) HOD predictions presented in Ta- 
bles 2 and 3, while good agreement with these predictions 
would indicate that luminosity is determined by halo mass 
with little dependence on formation epoch. Extending these 
arguments and analyses to samples of lower luminosity and 
additional selection criteria such as color or surface bright- 
ness will allow detailed exploration of the links between a 
galaxy's present day properties and the assembly history of 
its dark matter halo. 



values for which density dependence of the HOD has a neg- 
ligible impact on Wp(rp) but a readily measurable impact on 
the VPF and UPF. Furthermore, the two void statistics pro- 
vide somewhat complementary information, allowing one to 
estimate separately the critical density and the magnitude of 
the shift in Mmin- 

Because they are sensitive to environmental variations of 
the HOD, void statistics can remove one source of uncertainty 
in deriving cosmological constraints from observed galaxy 
clustering. For a specific example, recall that the o-g = 0.7 
model underpredicts Wp(rp) for Mr < -19 galaxies if we as- 
sume that the HOD is independent of environment (Fig.|3t). 
Raising M^i„ in low density regions could boost the galaxy 
bias factor and thereby improve the agreement between this 
model and the SDSS Wp{rp) measurements. However, any 
change large enough to alter the correlation function at this 
level should be easily detected or ruled out by void statistics. 
There may be environmental variations in high mass halos 
that could affect Wp(rp) without altering void statistics, but 
voids are the "canary in the coal mine" for any changes in 
the large scale galaxy bias factor driven by suppression (or 
enhancement) of galaxy formation in low density regions. 

In hierarchical scaling models, the void probability func- 
tion can be written as an infinite sum of all «-point correla- 
tion functions, which can in turn be expressed as products of 
the two-point correlation function (White 1979; Frv 1986). 
I^nrov et al. (2005) argue that the VPF provides little new 
information about the HOD once Wp(rp) has been accurately 
measured because of this hierarchical scaling. While we agree 
that the VPF is essentially determined by matching Wp{rp) for 
environment-independent HOD models, we do not think that 
hierarchical scaling alone explains this result. First, one could 
create two HOD models with virtually identical void statis- 
tics but very different correlation functions simply by moving 
satellite galaxies to more massive halos while keeping cen- 
tral galaxy occupations fixed. Conversely, our ifmm,^c) mod- 
els include cases with very similar correlation functions but 
substantially different VPFs. Hierarchical scaling arguments 
correctly predict the sign of the VPF changes in Figures |9] 
and ^2 — models with higher Wp{rp) have larger voids — 
but they do not explain the magnitude of these changes. The 
relation between void probabilities and the two-point correla- 
tion function depends not on the hierarchical ansatz alone but 
on the specific amplitudes that relate «-point functions to the 

APPENDIX 

THE VOIDED POISSON MODEL 

For a random distribution of points the correlation function is zero at all r. If spherical regions of radius Ry are voided, and the 
displaced galaxies rando mlv di s tribute d through the non-void regions, a correlation is induced in the points which is non-zero at 
scales smaller than 2j^v. lBab'uil ( 119911) presents the resulting correlation function for a given void radius Ry as 
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l+Cv(r,/?v) = expJ ^ 



r 

2R^ 



2Rv 
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(Al) 



for r < 2Rv, and l+£^(r,Rv) = 1 otherwise. The parameter is the filling factor of the voids, defined as fy = AnR\ny /3>, where 
ny is the number density of non-overlapping voids. 

Originally proposed as a model for the clustering of Lyman-a absorption clouds with "spheres of avoidance" carved out by 
quasars, this model is also applicable to galaxy clustering in our density-dependent HOD models. Lowering the efficiency of 
galaxy formation in low-density regions expands cosmic voids, as demonstrated in Figures |9l and ^3 The number density of 
voids can be derived from the VPF by 



ny(Ry) = Po(Ry)V: 



Po(Rv) 



3tt^ (n'vf 
32 V 



(A2) 
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where V denotes spherical volume, Vg is the volume within which a void with radius Ry can be moved without coming into 
contact with a galaxy, and 

n{Ry)= . „2^D (^3) 
\T^RydRw 

(|Betancort-Riio"199(T; 'Patiri et al.'"2004). The clustering of the voided Poisson model assumes that the void regions are cre- 
ated from initially filled re gion s, while the voids in the density-dependent HOD models are simply enlarged. For the filling 
factor required in equation ( IA1> . we take the difference between ny inferred from the density-dependent model and the density- 
independent model. This gives us the excess number density of voids create d by the density-dependence. To calculate the 
clustering induced by spectrum of voids of varying sizes, we integrate equation JAlt over Ry; 



^\\^ i^(r^R^)Y^ dRy 
Sl^dRy 



1 + ^v(r) = ^;,„/:/^" . (A4) 



The clustering boost induced by the voids applies only to the two-halo term of the correlation; the density of occupied halos in 
the inter-void space is increased, but the pairs within a single halo are not altered. The new two-halo term is 

1 + ?2h V) = [ 1 + [ 1 + 6h(0] , (A5) 

where ^2 h(>') is the density-independent two-halo term, which is calculated analytically. 

Figure IXT^ shows the results of the model for density-dependent mode ls of the M r < -21 sample with = 4. To calculate 
ny, we use the VPFs obtained from the N-body results in equations (IA2> and (IA3> . Panel (a) plots ^(r) for models with 6^ = 0. 
The filled circles show the results for the density-independent model. This value of produces a weak boost in the large- 
scale clustering, shown by the solid line. The dotted line, showing the results of the voided Poisson model, is difficult to see 
underneath the N-body results. Panel (b) plots the same results but for more extreme models with 6c = 0.4. The effect on ^(r) is 
more pronounced, but it is still well described by the voided Poisson model. Panels (c) and (d) show A[log^(r)] with respect to 
the density-independent model for the density-dependent N-body results and the model calculations. Panel (d) plots the results 
for 6c = 0, which boosts the large-scale amplitude of ^(r) by ~ 0.06 dex. The boost abruptly goes away at r < 1.5 /i"'Mpc, 
demonstrating that the change in clustering applies to the two-halo term only. For Sc = 0.4, shown in panel (d), the boost is ^ 0.15 
dex for the large-scale ^(r). The model agrees well with the N-body results, but turns over at r > 20 /;"' Mpc. This discrepancy is 
likely a result of truncating the integral in equation (IA4> at 25 Mpc, which is the limit of our N-body calculations of the VPF. 

The voided Poisson model offers an accurate quantitative explanation of the effect of our density-dependent HOD models 
on the correlation function. Unfortunately, as presented in this Appendix it cannot be used for full analytic modeling of the 
effect of density dependence without the use of N-body simulations, which were used here to obtai n ny. The need for N- 
body simulations could be circumvented with analytic approximations of the conditional mass function (Sheth & T ormenl2Q02t 
i Qottlober et al. 2003; Patiri et al, 2004 ). but it is unclear whether these approximations have the necessary accuracy for use as a 
tool for constraining density dependence with Wp(rp) measurements. Further investigation may prove fruitful. 
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Fig. A13. — Results of the voided Poisson model of Appendix A. All panels show results for models of the M,- < -21 sample with /mm=4. Panel (a): The 
solid line plots N-body results for ^(r) with 5c = 0, i.e., Mniin increases in all regions below the mean density on the 5 ft"' Mpc scale. The dotted Hne shows the 
analytic calculation. Circles plot the N-body results for the model with no density dependence. Panel (b): Same as (a), but with Sc = 0.4. Panel (c): The solid 
line plots the difference in log§ between density-dependent N-body results and density-independent N-body results from panel (a). The dotted Une plots the 
difference between the analytic calculation and the density-independent results. Panel (d): Same as (c), but for 5c = 0.4. 
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